3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
?
Size:
12000000 entries Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Medical Concept Embeddings via Labeled Background Corpora
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Eneldo Loza Mencía | Knowledge Engineering Group, TU Darmstadt | DE |
| Author 2 | Gerard de Melo | Tsinghua University | CN |
| Author 3 | Jinseok Nam | Knowledge Discovery in Scientific Literature, TU Darmstadt | DE |
| Main Contact | Eneldo Loza Mencía | Knowledge Engineering Group, TU Darmstadt | None |
Documentation:
http://www.biomedcentral.com/content/pdf/s12859-015-0564-6.pdfLanguage Type:
Multilingual
Languages:
English french
Availability:
ELRA
License:
ELRA
Size:
<Not Specified> Production Status:
<Not Specified>
Use:
Comparability measure assement
-
Paper title:Variations on quantitative comparability measures and their evaluations on synthetic French-English comparable corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Guiyao Ke | Université de Bretagne Sud | FR | Universite de Bretagne Sud, IRISA | FR |
| Author 2 | Pierre-Francois Marteau | Universite de Bretagne Sud, IRISA | FR | ||
| Author 3 | Gildas Menier | Universite de Bretagne Sud, IRISA | FR | ||
| Main Contact | Pierre-Francois Marteau | Universite de Bretagne Sud, IRISA | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Not Available
License:
Creative Commons
Size:
528 KByte Production Status:
Newly created-finished
Use:
Discourse
-
Paper title:Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | James Ravenscroft | University of Warwick | GB |
| Author 2 | Anika Oellrich | King's College London | GB |
| Author 3 | Shyamasree Saha | Queen's College London | GB |
| Author 4 | Maria Liakata | University of Warwick | GB |
| Main Contact | Maria Liakata | University of Warwick | None |
Documentation:
Guidelines of the annotation of multiple core scientific concepts.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
495 GByte Production Status:
Snapshot of a resource available via the National Library of Australia
Use:
Information Extraction, Information Retrieval
-
Paper title:Publishing the Trove Newspaper Corpus
-
Paper track:Written
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Steve Cassidy | Department of Computing, Macquarie University | AU |
| Main Contact | Steve Cassidy | Department of Computing, Macquarie University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English italian
Availability:
Freely available for research purposes
License:
<Not Specified>
Size:
1050 sentences Production Status:
Newly created-finished
Use:
Metaphor Identification and Analysis of Cultural Differences
-
Paper title:PROMETHEUS: A Corpus of Proverbs Annotated with Metaphors
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Gözde Özbal | FBK-irst | IT |
| Author 2 | Carlo Strapparava | FBK-irst | IT |
| Author 3 | Serra Sinem Tekiroglu | University of Trento, Fondazione Bruno Kessler | IT |
| Main Contact | Gözde Özbal | FBK-irst | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
4000 sentences Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Building A Case-based Semantic English-Chinese Parallel Treebank
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | huaxing shi | Harbin Institute of Technology | CN |
| Author 2 | Tiejun Zhao | Harbin Institute of Technology | CN |
| Author 3 | Keh-Yih Su | Institute of Information Science, Academia Sinica | TW |
| Main Contact | huaxing shi | Harbin Institute of Technology | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
7500 sentences Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Siamese CBOW: Optimizing Word Embeddings for Sentence Representations
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Poster - Monday
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Tom Kenter | University of Amsterdam | NL |
| Author 2 | Alexey Borisov | University of Amsterdam, Yandex | NL |
| Author 3 | Maarten de Rijke | University of Amsterdam | NL |
| Main Contact | Tom Kenter | University of Amsterdam | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English italian
Availability:
From Owner
License:
<Not Specified>
Size:
222.5 MByte Production Status:
Newly created-in progress
Use:
Summarisation
-
Paper title:The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Mijail Kabadjov | University of Essex | GB | ||
| Author 2 | Udo Kruschwitz | University of Essex | GB | University of Essex | None |
| Author 3 | Massimo Poesio | University of Essex | GB | ||
| Author 4 | Josef Steinberger | University of West Bohemia | GB | ||
| Author 5 | Jorge Valderrama | Websays | ES | ||
| Author 6 | Hugo Zaragoza | Websays | ES | ||
| Main Contact | Mijail Kabadjov | University of Essex | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
50 questions, with four choices each OtherProduction Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:What a Nerd! Beating Students and Vector Cosine in the ESL and TOEFL Datasets
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Enrico Santus | The Hong Kong Polytechnic University | HK |
| Author 2 | Alessandro Lenci | University of Pisa | IT |
| Author 3 | Tin-Shing Chiu | The Hong Kong Polytechnic University | HK |
| Author 4 | Qin Lu | The Hong Kong Polytechnic University | HK |
| Author 5 | Chu-Ren Huang | The Hong Kong Polytechnic University | HK |
| Main Contact | Enrico Santus | The Hong Kong Polytechnic University | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Open Directory License
Size:
4M websites OtherProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Thematic Cohesion: measuring terms discriminatory power toward themes
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Clément de Groc | Syllabs | FR |
| Author 2 | Xavier Tannier | LIMSI-CNRS, Univ. Paris-Sud | FR |
| Author 3 | Claude de Loupy | Syllabs | FR |
| Main Contact | Clément de Groc | Syllabs | None |
Documentation:
Yes, in english. Available at http://www.dmoz.org/docs/en/about.html




